CDS

Accession Number TCMCG075C15105
gbkey CDS
Protein Id XP_007026515.1
Location join(120465..120650,120865..121044,121749..121855,122067..122272,122775..122914,123220..123528)
Gene LOC18597417
GeneID 18597417
Organism Theobroma cacao

Protein

Length 375aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007026453.2
Definition PREDICTED: biotin synthase [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category H
Description biotin synthase
KEGG_TC -
KEGG_Module M00123        [VIEW IN KEGG]
M00573        [VIEW IN KEGG]
M00577        [VIEW IN KEGG]
KEGG_Reaction R01078        [VIEW IN KEGG]
KEGG_rclass RC00441        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01012        [VIEW IN KEGG]
EC 2.8.1.6        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00780        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00780        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0008270        [VIEW IN EMBL-EBI]
GO:0043167        [VIEW IN EMBL-EBI]
GO:0043169        [VIEW IN EMBL-EBI]
GO:0046872        [VIEW IN EMBL-EBI]
GO:0046914        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGTTGTCCATTCGATCCCCTGCGGCTCAGCTGCTGCTCAGAAGTTGTAGTTTCTACTCTACCACTGCATCGGCTGCAGCTGTGGAAGCTGAGAGAACTATCCGAGAAGGCCCTAGAAACGACTGGACACGCCAGCAGATCAAGTCCATCTACGATTCTCCTGTTCTTGATCTCCTTTTCCACGGAGCTCAAGTTCACAGATATGCTCATAACTTTCGAGAGGTGCAGCAGTGTACCCTCCTCTCAATCAAGACTGGTGGATGCAGTGAGGATTGTTCATACTGTCCTCAGTCCTCAAGGTATCATACAGGGCTAAAGGCCCAAAAGCTTATGACTAAGGAAGCTGTAATGCAGGCAGCTAAACAGGCAAAAGAGGCTGGTAGTACACGCTTTTGCATGGGTGCTGCATGGAGAGACACTGTAGGAAGGAAAACTAACTTCAACCAAATTCTTGAATATGTAAAACAAATTAGGGATATGGGAATGGAGGTGTGTTGCACTTTAGGCATGCTAGAGAAGCAGCAGGCTCTTGAACTCAAGAAGGCTGGCCTTACAGCTTACAACCATAATCTTGATACCTCAAGAGAATATTACCCAAACATTATTACAACCAGAACTTATGATGAGCGATTGGAAACCCTTCAACATGTCCGTGAAGCAGGAATTAATGTCTGTTCAGGTGGAATTATAGGGCTTGGAGAAGCAGAGGAGGACCGGGTTGGCTTGTTGCACACGCTGGCAACGCTTCCCTCTCACCCAGAGAGTGTTCCCATTAATGCCTTGGTTGCAGTGAAAGGCACGCCGCTTCAAGATCAAAAGCCTGTTGAAATATGGGAGATGATACGAATGATTGCAACTGCTCGTATAGCCATGCCAAAATCAATGGTAAGGTTGTCAGCTGGCAGAGTTCGATTCTCCATGCCTGAGCAGGCATTATGTTTTCTTGCCGGAGCAAATTCCATCTTCACTGGTGAGAAGCTGTTGACAACTCCTAACAATGATTTTGATGCTGATCAACTTATGTTCAAAATCCTTGGGCTGATTCCAAAAGCTCCAAGCTTTTCCGAGGAAGCAGCAAAGACTTCCGAAGCAGAAAACTGTGAGGAAGCTCTCTCAAGTTCGGGTTGA
Protein:  
MLSIRSPAAQLLLRSCSFYSTTASAAAVEAERTIREGPRNDWTRQQIKSIYDSPVLDLLFHGAQVHRYAHNFREVQQCTLLSIKTGGCSEDCSYCPQSSRYHTGLKAQKLMTKEAVMQAAKQAKEAGSTRFCMGAAWRDTVGRKTNFNQILEYVKQIRDMGMEVCCTLGMLEKQQALELKKAGLTAYNHNLDTSREYYPNIITTRTYDERLETLQHVREAGINVCSGGIIGLGEAEEDRVGLLHTLATLPSHPESVPINALVAVKGTPLQDQKPVEIWEMIRMIATARIAMPKSMVRLSAGRVRFSMPEQALCFLAGANSIFTGEKLLTTPNNDFDADQLMFKILGLIPKAPSFSEEAAKTSEAENCEEALSSSG